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1.  Introduction 

A  large  number  of  proteins  that  are  relevant  in  understanding  the  biological 
processes  are  expressed  at  low  levels  in  the  system.  Therefore,  there  is  a  need  for  highly 
sensitive,  high  throughput  methods  to  analyze  a  wide  dynamic  range  of  proteins. 

Our  group  has  demonstrated  that  high-throughput  proteomic  approaches  for  “protein 
profiling”  has  tremendous  potential  for  identifying  biomarkers  to  improve  cancer 
diagnosis  (Adam  et  al.  2002,  Qu  et  al.  2002,  Malik  et  al.,  2005). 

In  order  to  improve  the  ability  to  “mine”  the  full  depth  of  the  proteome,  Bruker 
Daltonics  has  newly  introduced  the  UltraFlex™  MALDI-TOF/TOF  instrument  equipped 
with  ClinProt,  a  robotic  bead-based  sample  processing  station.  The  instrument  provides 
improved  ability  to  mine  deeper  into  the  proteome,  improved  resolution/accuracy  and  the 
ability  to  achieve  peptide/protein  identification  (Suckau  et  al.  2003). 

2.  The  Specific  Aims  of  our  original  application  were- 

AIM 1.  Serum  cohort  to  identify  prostate  cancer  (PCa)  population  with  minimal 
clinical  symptoms. 

AIM  2.  Discovery  of protein  biomarkers  for  the  early  detection  of  PCa  in  cohort. 

AIM  3.  Isolation  and  identification  of  the  protein  biomarkers. 

AIM  4.  Development  of  MS-assisted  immunoassay  for  PCa  diagnostics. 


3.  Key  Research  Accomplishments 

3.A.  Serum  cohort  to  identify  prostate  cancer  (PCa)  population  with  minimal  clinical 
symptoms.  Our  studies  are  directed  at  the  male  population  that  present  with  marginal 
symptoms  (such  as  low  PSA  levels  and/or  positive  DRE)  and  who  undergo  biopsy.  We 
have  identified  and  collected  serum  specimens  from  185  patients  with  positive  prostate 
biopsy.  We  have  also  collected  a  set  of  223  serum  samples  from  patients  with  negative 
biopsy.  The  samples  are  stored  at  -70°C  in  small  aliquots  ready  to  be  used  for  this  study. 
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Only  pretreatment  samples,  obtained  at  the  time  of  diagnosis  of  prostate  cancer, 
were  collected  for  use  in  this  study.  All  samples  were  obtained  from  properly  consented 
patients  through  the  institutional  review  board  approved  protocols. 

3.B.  Serum  protein  profiling  for  PCa  diagnostics.  In  our  original  application,  we 
proposed  to  establish  the  clinical  utility  of  high-throughput  proteomic  approaches  to 
protein  profding.  In  addition,  we  also  proposed  to  extend  the  proof-of-concept  of  the 
utility  of  mass  spectrometry  based  approaches  to  specific  early  detection  objectives  in 
PCa  by  applying  these  proven  approaches  to  the  characterization  and  sequence 
identification  of  promising  biomarkers  for  detecting  early  cancer. 

Since,  the  proteomes  of  complex  body  fluids  like  serum  have  biomarkers  spread 
over  a  wide  range  of  concentrations  (Sernmes  et  al.  2006),  the  key  to  biomarker  discovery 
lies  in  effective  sample  preparation  prior  to  MS-based  analyses.  We  therefore  spent  a 
significant  portion  of  our  efforts  in  optimizing  the  strategies  to  enrich  low  abundant 
proteins  to  allow  for  their  identification  by  mass  spectrometry.  To  achieve  this  objective, 
we  have  been  developing  methodologies  for  up-front  fractionation  of  serum  for  MALDI 

profiling.  Initially  we  assessed  two  parameters,  which 
fractionation  strategy  provides  the  most  differential 
capture  of  proteins  between  case  and  control,  and 
whether  serum  depletion  prior  to  fractionation 
improves  the  detection  of  these  proteins.  We  utilized 
the  magnetic  bead-based  approaches  of  fractionation 
for  automation  of  the  techniques  on  the  ClinProt 
robotic  workstation.  The  employment  of 
functionalized  magnetic  bead  based  techniques  in 
conjunction  with  mass  spectrometry  combines  short 
processing  times  and  automatic  workflows  with  high- 
resolution  analyses. 


Figure  1.  Proteins  from  quality 


control  (QC)  serum  were  captured  by 
IMAC  (left)  or  ConA  lectin  (right) 
magnetic  beads.  Neat  serum  (load), 
flow-through  (unbound)  and  eluate 
(bound)  fractions  were  analyzed  by 
SDS-PAGE. 
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One  of  our  approaches  utilized  the  application  of  lectin  capture  to  enrich  for 
glycosylated  forms  of  proteins  in  sera  from  PCa  subjects.  The  lectin  affinity  approach 
offers  several  distinct  biochemical  advantages  toward  protein  fractionation. 

Glycosylation  is  with  at  least  50%  the  most  common  form  of  post-translational 
modification  of  proteins  and  the  degree  and  type  of  glycosylation  depend  on  the  status  of 
cells,  thus  linking  them  to  certain  diseases  (Dube  and  Bertozzi  2005).  Besides,  many 
major  serum  proteins,  in  particular  albumin,  are  not  glycosylated  and  are  therefore  not 
bound  efficiently  by  lectins,  offering  a  concomitant  decrease  in  complexity  of  the  serum 
protein  targets  (Figure  1).  There  are  many  lectin  types  available  commercially  in  pure, 
bead-based  fonns  which  allows  for  automation  of  assays  in  the  front-end  to  mass 
spectrometry  and  thereby  facilitating  high  through-put  analyses  (Sparbier  et  al.  2005, 
2006). 

The  biological  affinity  of  lectins  offers  multiple  uses  and  strategies  to  maximize 
the  information  gained  from  precious  clinical  samples.  Our  group  has  demonstrated  in  a 
proof-of-concept  study,  the  specific  enrichment  of  glycosylated  peptides  and  proteins 
from  PCa  sera  by 
lectin  affinity 
chromatography 
supported  by 
functionalized 
magnetic  particles.  I 
am  a  co-author  in  this 
research  that  has  been 
published  this  year 
(Drake  et  al.  2006). 

Reprint  of  the  article  has  been  attached  at  the  end  of  this  report. 


Figure  2.  Experimental  flow  chart  for  protein  profiling  using  mass  spectrometry. 
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Our  approach  to  utilize  the  lectin  capture  strategy  for  high-throughput  serum 
expression  profiling  is 
summarized  in  figure  2.  This 
involves  capture  of  the  intact 
glycoproteins  by  lectins, 
followed  by  trypsin  digestion 
to  generate  peptides  and  final 
affinity  separation  of  peptides 
prior  to  MALDI-TOF  and/or 
tandem  mass  spectrometry 
analyses.  All  of  these  steps  can 
be  fully  automated  using  bead- 
based  supports  coupled  to 
liquid  sample  handling  robotics.  We  propose  to  utilize  the  lectin  affinity  of  Concanavalin 
A  (ConA)  coated  magnetic  beads,  which  can  be  applied  as  a  general  tool  for  capturing  of 
A-glycosylated  peptides  and  proteins  with  broad  specificity.  The  isolated  glycoproteins 

will  be  digested  with  trypsin  and 
the  resulting  tryptic  peptides 
purified  using  ClinProt  MB-IMAC 
Cu  beads. 

Each  fraction  will  be 
analyzed  by  MALDI-TOF  MS  in 
both  the  linear  and  reflector  mode. 
Suitable  peptide  peaks  can  then  be 
analyzed  by  MALDI-TOF/TOF 
MS  to  identify  the  corresponding 
proteins  by  database  search.  For 
comparison,  the  same  serum  will 
be  purified  by  metal  affinity  beads 
(MB-IMAC  Cu)  without  trypsinization.  The  eluates  will  be  directly  analyzed  by  MALDI- 
TOF  MS  (Figure  3).  Spectra  can  also  be  acquired  in  a  much  wider  mass  range  in  the 


Trypsin  Digested 


Figure  4.  Pooled  sera  from  benign  prostatic  hyperplasia  (BPH) 
and  prostate  cancer  (PCa)  patients  were  bound  to  ConA  lectin.  A 
portion  of  bound  proteins  were  separated  on  ID  SDS-PAGE  (A.), 
either  undigested  (lanes  1,2)  or  digested  with  trypsin  (lanes  3,4). 
(B.)  The  same  eluates,  trypsin  digested  (upper)  or  undigested 
(lower),  were  incubated  with  MB-IMAC  Cu  beads  and  analyzed 
on  Ultraflex™  in  linear  mode.  Reprinted  from  Drake  RR  et  al. 
Mol.  Cell.  Proteomics.  2006  Jun  7. 


Figure  3.  Diluted  quality  control  serum  was  digested  with  trypsin  at 
various  enzyme  to  substrate  ratios  (Enz:Subs).  Digested  peptides 
were  purified  using  ClinProt  MB-IMAC  Cu  beads  and  analyzed  by 
MALDI-TOF  MS  in  linear  mode.  Undigested  neat  (undiluted)  serum 
was  run  directly  on  ClinProt  MB-IMAC  Cu  as  control. 
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linear  mode  by  applying  different  matrices  such  as  CHCA,  SPA,  HCCA,  2,5-DHAP  to 
the  eluates. 

Our  approach  integrates  a  high-resolution  separation  of  digest-generated  peptides 
with  increasingly  sophisticated  mass  spectrometry  for  “bottom-up”  differential 
identification.  This  approach,  utilizing  primarily  fractionation  of  peptides  generated  from 
complex  protein  mixtures  promises  a  true  reflection  of  the  native  proteins  and  that  the 
peptides  behave  more  uniformly  in  both  fractionation  and  detection.  Digest-generated 
peptides  are  then  subjected  to  mass  spectrometry  analysis  (Figure  4). 

3.C.  Isolation  and  identification  of  the  protein  biomarkers.  We  proposed  to  use  the 
UltrFlex™  TOF/TOF  mode  for  highly  sensitive  and  accurate  tandem  mass  spectrometry 
for  peptide  mass  fingerprints  (PMF).  When  operated  in  the  TOF/TOF  mode  the 
UltraFlex™  achieves  very  high  resolution,  accuracy  and  signal  to  noise  and  effective 
tandem  mass  spectrometry  for  protein  identification  (Figure  5). 

Initial  analysis  of 
sample  fractions  in  the 
linear  mode  will  allow 
for  visualization  of  the 
target  peak.  When  the 
peak  has  been  targeted, 
identification  is  achieved 
with  the  UltraFlex™ 
which  employs  ion 
potential  lift  (LIFT)  in  a 
MALDI-TOF/T  OF 
platform  for  highly 
sensitive  (attomolar 
range)  and  accurate 
tandem  mass 

spectrometry  for  peptide  mass  fingerprints  (PMF).  Depending  on  the  size  of  the  target 
peak  we  will  use  trypsinized  fractions  in  combination  with  Laser  Induced  Dissociation 


Figure  5.  Pooled  quality  control  serum  was  digested  with  Trypsin  and 
analyzed  by  MALDI-TOF  MS  after  purification  with  ClinProt  MB-IMAC  Cu 
beads.  The  sample  fractions  were  run  on  both  Linear  (upper)  and  Reflector 
(lower)  mode. 
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(LID)  for  most  protein  identification  and  Collision  Induced  Dissociation  (CID)  for  de 
novo  sequencing  and  resolving  Leu/Ile  ambiguity.  On  the  MALDI  platfonn  these  can  be 
performed  serially  on  the  same  sample  (see  Suckau  et  al.  2003  for  details). 

A  major  advantage  to  the  combined  ClinProt  UltraFlex™  system  is  the  ability  to 
directly  scale  up  for  isolation  and  purification  prior  to  applying  sequence  identification 
efforts.  Our  front-end  fractionation  strategies  will  help  scale  up  the  amount  and  purity  of 
the  sample  to  facilitate  success  in  the  tandem  mass  operation. 

3. D.  Development  of  MS-assisted  immunoassays  for  PCa  diagnostics.  The  ability  of  the 
identified  and  validated  biomarkers  to  diagnose  prostate  cancer,  especially  in  sample 
groups  where  PSA  fails  to  detect  cancer  (clinical  gray  area),  would  be  tested  using  large 
sample  sets  on  MALDI  and  SELDI-based  immunoassays. 

4.  REPORTABLE  OUTCOMES 

4.A.  Serum  samples  from  223  patients  with  negative  prostate  biopsy  and  185  patients 
with  positive  prostate  biopsy  in  the  clinical  gray  area  of  diagnosis 
(PSA<4.0ng/ml;  Abnormal  DRE  and/or  elevated  PSA  etc.)  are  collected  and 
stored  in  our  serum  repository  for  use  in  this  study. 

4.B.  Initial  evaluation  of  the  pre-fractionation  of  serum  samples  prior  to  MALDI-TOF 
MS  has  been  performed.  Based  on  the  results  of  the  initial  pilot-experiments, 
bead-based  lectin  capture  prior  to  MS  analysis  generated  the  best  outcome  and 
will  be  performed  in  this  study.  Results  of  this  research  are  in  press  (Drake  et  al. 
Mol.  Cell.  Proteomics  June  2006).  Manuscript  is  attached  at  the  end  of  the  report. 

4.C.  For  protein  identification,  analysis  of  tryptic  peptides  generated  from  serum  were 
analyzed  my  MALDI  in  the  TOF/TOF  mode.  A  review  of  various  MS  techniques 
for  prostate  cancer  diagnostics  was  reported  this  year  in  Semmes  et  al. ,/.  Cell. 
Biochem.  June  2006,  in  which  I  am  a  co-author.  Manuscript  is  attached  at  the  end 
of  the  report. 

4.D.  During  my  postdoctoral  training  here  at  EVMS,  I  was  offered  a  position  at  the 
Cancer  Therapy  and  Research  Center  (CTRC)  Institute  for  Drug  Development 
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(IDD)  in  San  Antonio,  Texas.  I’ll  be  joining  there  as  a  Senior  Research  Associate 
from  Oct  2,  2006. 
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Application  of  Mass  Spectrometry  to  the  Discovery  of 
Biomarkers  for  Detection  of  Prostate  Cancer 

O.  John  Semmes,*  Gunjan  Malik,  and  Mike  Ward 

Department  of  Microbiology  and  Molecular  Cell  Biology,  Center  for  Biomedical  Proteomics, 

Virginia  Prostate  Center,  Eastern  Virginia  Medical  School,  Norfolk  Virginia 


Abstract  There  has  been  an  impressive  emergence  of  mass  spectrometry  based  technologies  applied  toward  the 
study  of  proteins.  Equally  notable  is  the  rapid  adaptation  of  these  technologies  to  biomedical  approaches  in  the  realm  of 
clinical  proteomics.  Concerted  efforts  toward  the  elucidation  of  the  proteomes  of  organ  sites  or  specific  disease  state  are 
proliferating  and  from  these  efforts  come  the  promise  of  better  diagnostics/prognostics  and  therapeutic  intervention. 
Prostate  cancer  has  been  a  focus  of  many  such  studies  with  the  promise  of  improved  care  to  patients  via  biomarkers  derived 
from  these  proteomic  approaches.  The  newer  technologies  provide  higher  analytical  capabilities,  employ  automated 
liquid  handling  systems,  fractionation  techniques  and  bioinformatics  tools  for  greater  sensitivity  and  resolving  power, 
more  robust  and  higher  throughput  sample  processing,  and  greater  confidence  in  analytical  results.  In  this  prospects,  we 
summarize  the  proteomic  technologies  applied  to  date  in  prostate  cancer,  along  with  their  respective  advantages  and 
disadvantages.  The  development  of  newer  proteomic  strategies  for  use  in  future  applications  is  also  discussed.  J.  Cell. 
Biochem.  98:  496-503,  2006.  ©  2006  Wiley-Liss,  Inc. 
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Prostate  cancer  is  the  most  common  non-skin 
cancer  in  the  US.  In  2005,  more  than  200,000 
men  have  been  diagnosed  with  prostate  cancer, 
and  over  30,000  men  will  die  from  it,  making  it 
the  second-most  leading  cause  of  cancer-related 
deaths  among  men  in  the  US.  If  diagnosed  early, 
prostate  cancer  can  be  effectively  treated  by 
surgery  or  radiation.  However,  every  year, 
70,000  men  require  additional  treatment  due 
to  recurrence  of  the  disease.  Prostate  cancer  is  a 
complex  heterogeneous  disease  that  acts  differ¬ 
ently  in  different  men.  The  slow  rate  of  prostate 
cancer  growth,  coupled  with  the  widely  varied 
presentation,  has  made  it  difficult,  if  not 
impossible,  to  determine  conclusively  which 
treatment  is  best  for  which  man. 
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Early  prostate  cancer  usually  has  no  symp¬ 
toms  and  is  most  commonly  detected  through 
prostate  cancer  screening  tests  such  as  the 
prostate  specific  antigen  (PSA)  blood  test  and 
digital  rectal  exam.  An  elevated  PSA  level  in  the 
bloodstream  does  not  necessarily  indicate  pros¬ 
tate  cancer,  since  PSA  levels  can  be  altered  by 
infection  or  other  prostate  conditions  such  as 
benign  prostatic  hyperplasia  (BPH).  Although 
the  standard  PSA  test  remains  the  most  widely 
used  screening  assay  for  prostate  cancer, 
approximately  25%  of  men  with  prostate  cancer 
have  a  PSA  level  below  4.0  ng/ml  and  only  25% 
of  men  with  a  PSA  level  of  4—10  ng/ml  have 
prostate  cancer.  Indeed  significant  numbers  of 
men  with  an  elevated  PSA  do  not  have  prostate 
cancer.  Thus  there  is  a  need  for  more  accurate 
and  non-invasive  techniques  to  detect,  diag¬ 
nose,  and  stratify  the  disease  based  on  molecu¬ 
lar  markers  present  in  the  body  fluids. 

WHY  PROTEOMICS? 

Most  of  the  physiological  changes  in  cancer 
are  mediated  by  molecular  alterations  at  the 
protein  level  many  of  which  would  not  be 
expected  to  be  revealed  at  the  DNA/RNA  level. 
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Disease  specific  changes  arising  from  the  tumor 
cell  or  microenvironment  can  be  utilized  to 
provide  biomarkers  that  can  guide  treatment 
decisions  at  the  molecular  level.  These  biomar¬ 
ker  proteins  can  be  uncovered  by  comprehen¬ 
sive  protein  analysis  of  cells,  tissues,  and  body 
fluids  (like  blood,  seminal  plasma,  and  prostatic 
fluid)  as  well  as  artificially  generated  animal 
models  and  cell  lines.  Blood  has  been  a  particu¬ 
larly  attractive  target  proteome  source  because 
cellular  biomarkers  routinely  “leak”  into  the 
body  fluids.  In  addition,  blood  is  easy  to  handle, 
and  acquisition  is  non-invasive  and  this  pro¬ 
teome  likely  harbors  a  true  picture  of  the 
physiological  state  of  the  patient. 

Potential  proteomic  biomarkers  of  prostate 
cancer  can  not  only  benefit  in  earliest  detection 
of  disease  but  can  also  be  used  for  determining 
cancer  risk,  stratifying  disease  stage  and 
grade,  monitoring  response  to  therapy,  and  in 
general  assisting  in  therapeutic  decision  mak¬ 
ing.  Through  careful  sample  selection,  proper 
study  design,  automation  in  sample  handling 
and  processing,  proteomic  platforms  are  fast 
becoming  very  powerful  tools  in  prostate  cancer 
research. 

APPROACHES  TO  CLINICAL  PROTEOMICS 

Proteomic  studies  dating  from  the  1970s 
utilized  the  technique  of  two-dimensional  gel 
electrophoresis  to  display  a  large  number  of 
proteins  from  a  given  cell-line  or  organism 
[O’Farrell,  1975].  The  technique  works  as  a 
powerful  tool  for  comparative  analyses  of  pro¬ 
tein  expression  levels  between  samples.  How¬ 
ever  it  soon  became  clear  that  this  approach  was 
limited  in  application  with  respect  to  the  needs 
of  clinical  proteomics.  The  resolving  power  is 
limited  by  mass  and  pi,  the  technique  is  not 
high-throughput,  the  platform  is  limited  in 
reproducing  similar  2-D  patterns  and  needs 
larger  amounts  of  samples  for  processing.  In 
response  to  these  limitations  researchers  have 
incorporated  fluorescent  dyes  in  a  process 
termed  2-D  differential  in-gel  electrophoresis 
or  DIGE  [Unlu  et  al.,  1997],  to  differentially 
label  proteins  from  multiple  sources  and  ana¬ 
lyze  the  patterns  of  each  on  the  same  2-D  gel. 
The  technique  allows  for  increased  throughput 
and  easier  comparative  expression  analysis  bet¬ 
ween  samples.  Although,  these  recent  advances 
in  staining  techniques  using  fluorescent  dyes, 
along  with  the  use  of  pre-fractionation  approa¬ 


ches  [Van  den  Bergh  et  al.,  2003]  and  narrower 
pH  ranges  in  the  first  dimension  along  with 
large  format  gels  [Gorg  et  al.,  2002]  are  improv¬ 
ing  the  sensitivity  and  effectiveness  (reviewed 
by  [Lilley  and  Friedman,  2004]),  nevertheless  a 
need  remains  for  high-throughput  applications 
capable  of  simultaneously  assessing  the  pro¬ 
teome  of  population-representative  sample  sets. 

Combinatorial  approaches  include  a  combi¬ 
nation  of  pre-fractionation  and  gel  electrophor¬ 
esis  with  mass  spectrometry  techniques.  The 
approach  has  been  utilized  for  proteomic  ana¬ 
lysis  of  human  prostate  cancer  [Nelson  et  al., 
2000;  Ahram  et  al.,  2002;  Meehan  et  al.,  2002]. 
Using  a  combination  of  laser  capture  micro¬ 
dissection,  2-D  PAGE  followed  by  LC  MS/MS 
analysis  of  the  tryptic  digests  of  the  protein 
spots,  Ahram  et  al.  [2002]  identified  40  tumor 
specific  protein  expression  changes.  With  the 
combination  of  2-D  PAGE,  MALDI-TOF  MS, 
peptide  mass  fingerprinting  (PMF)  and  N- 
terminal  protein  sequencing,  Meehan  et  al. 
[2002]  identified  20  protein  alterations  in 
prostate  tissues  and  validated  them  by  Western 
blotting  and  immuno-histochemistry  (IHC). 
Similarly,  the  combination  of  cDNA  microar¬ 
rays,  2-DE  and  MS  has  been  employed  to 
generate  global  gene/protein  expression  profiles 
of  androgen-stimulated  prostate  cell  lines  by 
Nelson  et  al.  [2000]. 

Even  more  recently  the  numbers  of  technical 
approaches  available  to  proteomic  analysis  are 
proliferating  at  a  staggering  pace;  to  the  extent 
that  evaluating  the  merit  of  individual  approa¬ 
ches  has  become  a  top  priority  as  evidenced  by 
several  recent  requests  for  applications  issued 
by  the  National  Cancer  Institutes.  Among  the 
many  technical  hurdles  to  successful  proteome 
mining,  the  two  most  prominent  are  likely 
the  daunting  numbers  of  different  protein 
entities  and  the  existing  range  of  protein  con¬ 
centration.  Estimates  of  potential  protein  types 
can  reach  into  the  millions  when  considering 
post-translational  modification  events  and  the 
relative  concentration  range  can  span  12  orders 
of  magnitude.  Thus,  researchers  must  contend 
with  achieving  utility  in  both  resolution  and 
sensitivity  of  a  given  technique.  An  additional 
level  of  complexity  exists  when  one  considers 
the  heterogeneity  of  individuals,  which  is  a 
significant  confounding  factor  in  the  study 
design  of  successful  biomarker  efforts.  Solu¬ 
tions  to  these  issues  have  come  in  the  form  of  so 
called  “top  down”  proteomics  in  which  whole 
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proteins  are  separated  and  case  versus  control 
differential  established  prior  to  mass  spectro¬ 
metry  analysis.  Alternatively,  complex  protein 
mixtures  can  be  enzymatically  digested  prior  to 
separation  and  differential  expression  determi¬ 
nation  using  mass  spectrometry  in  “bottom  up” 
approaches.  This  latter  approach  was  enabled 
via  improved  tandem  mass  spectrometry  ad¬ 
vances.  In  ah  of  these  conceptual  approaches 
(see  Fig.  1),  “front-end”  sample  fractionation 
and  separation  strategies  are  required  to  reduce 
the  complexity  of  native  clinical  samples  (or  cell 
lysates)  and  the  technical  improvements  in  this 
area  have  grown  as  well. 
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SCREENING  ALTERNATIVES  TO 
MS-BASED  PROTEOMICS 

There  are  several  very  exciting  approaches  to 
high-throughput  screening  proteomics  approa¬ 
ches  that  will  not  be  discussed  in  detail  here. 
These  include  antibody  arrays  (reviewed  by 
[Haab,  2005])  which  have  had  and  will  continue 
to  have  significant  applications  in  cancer  re¬ 
search.  The  antibody-arrays  have  been  utilized 
for  protein  profiling,  biomarker  identification, 
protein  characterization  and  in  some  cases  the 
detection  of  protein  post-translational  modifica¬ 
tions.  Some  notable  success  stories  of  interest  to 
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Fig.  1.  Two  major  approaches  to  clinical  proteomics.  A:  In  the 
Top-down  proteomics  approach,  whole  proteins  are  pre¬ 
fractionated  via  various  gel  and  non-gel  based  techniques.  There 
are  obvious  scientific  advantages  for  determining  protein 
complexes  and  post-translational  modifications  when  employ¬ 
ing  these  approaches.  Intact  proteins  of  interest  are  then 
subjected  to  subsequent  MS-based  analyses  using  either  single 
or  tandem  mass  spectrometry.  Throughput  is  usually  indirectly 
proportional  to  the  amount  of  information  gathered.  B:  The 
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bottom-up  proteomics  approach  utilizes  primarily  non-gel  based 
fractionation  of  peptides  generated  from  complex  protein 
mixtures.  The  rationale  is  that  the  digested  peptides  will  reflect 
the  native  proteins  and  that  the  peptides  behave  more  uniformly 
in  both  fractionation  and  detection.  Digest-generated  peptides 
are  then  subjected  to  mass  spectrometry  analysis.  The  approach 
can  involve  simple  peptide  mass  profiling  as  well  as  quantitative 
tandem  mass  spectrometry  to  yield  protein  identification  and 
relative  protein  concentration. 
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this  audience  would  be  in  the  analysis  of 
proteins  in  tumors  [Knezevic  et  al.,  2001],  the 
specific  analysis  of  the  prostate  cancer  humoral 
antibody  response  in  patients  to  antigens  [Webb 
etal.,  1981;  Wang  et  al.,  2005a]  as  well  as  serum 
protein  expression  profiling  [Miller  et  al.,  2003]. 
The  development  of  tissue  microarray  (TMA) 
technology  [Kononen  et  al.,  1998]  has  initiated 
large-scale  studies  using  tumor  tissues.  The 
technique  has  also  been  widely  extended  to 
prostate  cancer  studies  (reviewed  by  [Kuefer 
et  al.,  2004;  Watanabe  et  al.,  2005])  specifically 
for  protein  expression  profiling,  biomarker 
validation  [Rubin  et  al.,  2002;  Zellweger  et  al., 
2003],  and  study  of  tumor  biology  [Sun  et  al., 
2003;  Boddy  et  al.,  2005;  Chuan  et  al.,  2005]. 
TMA  technology  has  considerable  value  in 
translating  the  information  gained  from  initial 
discovery  into  clinical  applications.  Protein 
arrays  have  been  used  to  detect  antibodies  in 
samples  against  a  set  of  cancer  antigens 
[Robinson  et  al.,  2002].  This  approach  may  also 
involve  the  arraying  of  uniquely  designed 
antigens  and  has  been  successfully  applied  in 
prostate  cancer  studies  using  patient  samples 
or  cancer  models  [Lagarkova  et  al.,  2003; 
Nishizuka  et  al.,  2003;  Zhang  et  al.,  2003]. 
Thus,  it  is  essential  that  the  value  of  these 
approaches  not  be  lost  in  the  focus  of  this  review 
on  the  emerging  mass  spectrometry  based 
techniques. 

MS-BASED  APPROACHES  TO 
CLINICAL  PROTEOMICS 

Top-down:  The  highly  complex  and  wide 
dynamic  range  of  proteins/peptides  in  body 
fluids  needs  high-resolution  systems  for  bio¬ 
marker  mining.  Moreover,  in  complex  body 
fluids  like  serum,  the  biomarkers  could  be 
spread  over  a  wide  range  of  concentrations. 
One  of  the  ways  to  ease  the  “mining”  of 
biomarkers  in  complex  proteomes  is  via  the 
separation  of  whole  proteins  prior  to  MS-based 
analyses.  Two-dimensional  liquid  chromatogra¬ 
phy-based  technologies  (2D-LC)  are  the  most 
widely  used  techniques  for  the  this  type  of 
approach  [Yan  et  al.,  2003;  Kolch  et  al.,  2005; 
Qin  et  al.,  2005].  Pre-fractionation  of  samples  in 
liquid  phase  prior  to  biomarker  mining  not  only 
reduces  the  proteome  complexity  of  body  fluids 
like  serum  but  also  allows  for  automation  of 
sample  processing  before  the  analyses  of  the 
fractions.  Capillary  electrophoresis  coupled  to 


mass  spectrometry  has  also  been  utilized  in 
several  studies  for  a  high  resolution  fast 
separation  of  complex  fluids  like  urine  [Chal¬ 
mers  et  al.,  2005;  Fliser  et  al.,  2005]. 

Protein  expression  profiling  using  either 
MALDI-TOF  or  SELDI-TOF  approaches  has 
seen  a  wide  application  to  many  disease  sites 
including  prostate  cancer  [Peter  et  al.,  2001; 
Adam  et  al.,  2002;  Cazares  et  al.,  2002;  Petricoin 
et  al.,  2002;  Qu  et  al.,  2002;  Banez  et  al.,  2003; 
Kidd  et  al.,  2003;  Lehrer  et  al.,  2003].  The 
SELDI-TOF  approach  utilizes  a  chip-based 
affinity  capture  procedure  to  reduce  sample 
complexity  and  then  “profiles”  the  bound  intact 
proteins  by  mass.  The  technique  is  sensitive, 
needs  minimal  amount  of  protein,  and  is  rela¬ 
tively  high  throughput  (reviewed  by  [Wright, 
2002;  Conrads  et  al.,  2004;  Solassol  et  al.,  2005]). 
Our  laboratory  and  others  have  been  employing 
a  combination  of  chromatographic  paramag¬ 
netic  beads  and  MALDI  TOF/TOF  MS  to 
present  a  powerful  and  sensitive  analysis  of 
pre-fractionated  samples  (reviewed  by  [Pusch 
and  Kostrzewa,  2005]).  The  paramagnetic 
beads  allow  for  reasonable  high  throughput 
processing  and  reproducible  fractionation  of 
proteins/peptides  followed  by  MALDI-TOF  MS 
analysis  (Fig.  2).  Since  the  introduction  of  this 
technology  to  the  field,  the  technique  has  been 
widely  used  for  single  or  multidimensional 
separation  of  proteins/peptides  on  the  beads. 
The  fractions  are  then  spotted  on  target  plates 
for  MALDI-TOF  analysis  [Villanueva  et  al., 
2004].  Although  not  yet  fully  realized,  this 
approach  via  sophisticated  TOF/TOF  capabil¬ 
ities  offers  direct  protein  identification  with 
little  or  no  additional  work-up.  We  have  been 
particularly  interested  in  the  utility  of  this 
instrumentation  in  improving  so  called  immuno- 
MS  (Fig.  2)  which  is  an  approach  that  we  first 
reported  on  in  early  2000  using  SELDI-TOF. 
The  incorporation  of  immuno-MS  provides  for 
early  validation  of  biomarkers  discovered  on  the 
same  platform  and  offers  distinct  advantage 
over  ELISA  in  that  isoforms,  modifications  and 
cleavage  products  can  be  evaluated  with  the 
same  antibody. 

Bottom  up:  Various  non-gel  based  liquid 
chromatography  techniques  focusing  on  pep¬ 
tides  are  gaining  attention,  as  they  allow  multi¬ 
dimensional,  automated  separation  of  peptides 
representing  very  low  abundance  of  proteins. 
The  capabilities  of  these  techniques  to  perform 
proteome  analysis  from  minimal  samples  has 
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Fig.  2.  Introduction  of  automated  high-resolution  MALDI-TOF 
based  approaches  (A)  Using  a  combination  of  affinity  selection 
on  paramagnetic  beads  and  downstream  mass  spectrometry, 
a  high-resolution  and  high-throughput  MALDI-TOF  based 
approach  has  been  developed.  The  technique  utilizes  the 
affinity-capture  of  proteins/peptides  from  complex  fluids  using 
a  variety  of  capture  molecules.  The  bound/unbound  protein 
fractions  can  then  be  spotted  on  a  target  plate  and  analyzed  by 


MALDI-TOF  MS.  The  procedure  profoundly  reduces  the  sample 
complexity  and  then  "profiles"  the  bound  intact  proteins  by 
mass.  B:  The  automated  paramagnetic  bead  system  can  also  be 
used  in  combination  with  differential  protein  labeling  for  a 
quantitative  MS  analysis.  Using  mass-shift  and  mass-defect 
tagging  of  protein  mixtures  from  different  sources,  the  high 
resolution  MALDI-TOF/TOF  approach  allows  for  direct  protein 
quantitation  and  identification. 


generated  new  prospects  for  biomarker  discov¬ 
ery  especially  using  selected  cell  populations 
from  tissue  specimens  [Wang  et  al.,  2005b].  A 
high-resolution  chromatographic  separation  of 
digest-generated  peptides  prior  to  mass  spectro¬ 
metry  analysis  without  the  involvement  of 
gel  electrophoresis  has  potential  application  to 
clinical  proteomics.  Such  micro  fluidic  systems 
have  already  been  integrated  with  increasingly 
sophisticated  mass  spectrometry  for  bottom-up 
differential  identification  [Brivio  et  al.,  2002;  Li 
et  al.,  2002;  Zhu  et  al.,  2003;  Metodiev  et  al., 
2004].  Peptide  quantification  using  a  combina¬ 
tion  of  multidimensional  liquid-chromatogra¬ 
phy,  protein  labeling  and  digestion  has  also 
been  reported  recently  [Gygi  et  al.,  1999;  Griffin 
et  al.,  2003;  DeSouza  et  al.,  2005]  adding  direct 
quantitation  and  thus  making  it  a  much  more 


powerful  tool.  The  technique  of  differential 
peptide  display  (DPD)  has  been  recently  uti¬ 
lized  to  analyze  the  peptidome  of  the  HUPO 
human  serum  and  plasma  specimens  [Tammen 
et  al.,  2005].  The  samples  were  fractionated  on 
RP-HPLC  and  each  fraction  is  applied  to 
MALDI-TOF  MS  to  generate  an  in  silico  2-D 
display  of  peptide  masses.  A  combinatorial 
approach  of  protein  fractionation  using  HPLC, 
tryptic  digestion  and  RPLC-MS/MS  has  also 
been  recently  utilized  to  characterize  the  mouse 
serum  proteome  [Hood  et  al.,  2005].  Using  these 
techniques  the  group  identified  12,300  unique 
peptides  originating  from  4567  unique  mouse 
serum  proteins. 

A  number  of  groups  are  now  trying  to  identify 
and  analyze  proteins  from  less  complex  mix¬ 
tures  such  as  seminal  fluids  [Utleg  et  al.,  2003; 
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Fung  et  al.,  2004],  laser  captured  cells  from 
cancer  tissues  [Paweletz  et  al.,  2001;  Cazares 
et  al.,  2002;  Diaz  et  al.,  2004],  albumin-asso¬ 
ciated  proteins  from  blood  sera  [Lowenthal 
et  al.,  2005],  glycosylated  proteins  [Manning 
et  al.,  2004;  Yang  and  Hancock,  2005;  Yang 
et  al.,  2005],  or  sub  cellular  fractions  from 
cancer  cells  [Gretzer  et  al.,  2004].  The  up-front 
reduction  in  sample  complexity  helps  to  reduce 
the  numbers  of  proteins  being  interrogated  thus 
effectively  increasing  the  coverage  of  the  dis¬ 
ease  proteome. 

Clearly  there  is  a  demand  for  enabling  the 
adaptation  of  cutting-edge  mass  spectroscopy 
approaches  to  clinical  proteomics.  These  solu¬ 
tions  will  likely  focus  on  improving  sample 
acquisition  and  handling,  reducing  sample 
complexity,  increasing  sample  throughput, 
and  improving  sensitivity/resolution  in  ion 
detection.  It  is  also  clear  that  this  need, 
delineated  by  the  pioneering  work  of  a  handful 
of  clinical  proteomics  laboratories,  has  been 
noted  by  the  mainstream  mass  spectrometry 
community.  However,  technology  alone  cannot 
drive  future  success  in  the  application  of 
proteomics  to  prostate  cancer.  Clearly,  a  con¬ 
certed  multi-disciplinary  effort  is  needed.  Cen¬ 
tral  to  this  collaboration  is  the  biochemist  with  a 
greater  understanding  of  protein  behavior  and 
the  tools  to  tease  proteins  from  the  proteome.  In 
fact  the  advances  in  the  application  of  mass 
spectrometry  to  proteins  should  signal  a  renais¬ 
sance  in  classic  biochemistry,  an  expertise  that 
had  largely  given  way  to  molecular  biology. 
Studies  involving  mouse  models,  cell  lines  and 
direct  human  samples  need  to  be  coordinated 
toward  the  same  clinical  goals.  For  example, 
uncovering  the  proteome  changes  associated 
with  exposure  of  LNCaP  cells  to  androgen 
[Meehan  and  Sadar,  2004],  would  compliment 
nicely  with  similar  studies  in  androgen  resis¬ 
tant  mouse  models  and  proteomic  analysis  of 
prostatic  fluids  from  patients  with  androgen- 
resistance  transition.  In  the  end,  study  design 
and  solid  biochemistry  will  push  the  success 
envelope  of  new  technologies  in  clinical  pro¬ 
teomics. 
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ABSTRACT 


The  application  of  mass  spectrometry  to  identify  disease  biomarkers  in  clinical 
fluids  like  serum  using  high-throughput  protein  expression  profding  continues  to  evolve 
as  technology  development,  clinical  study  design,  and  bioinfonnatics  improve.  Previous 
protein  expression  profiling  studies  have  offered  needed  insight  into  issues  of  technical 
reproducibility,  instrument  calibration,  sample  preparation,  study  design  and  supervised 
bioinfonnatics  data  analysis.  In  this  overview,  new  strategies  to  increase  the  utility  of 
protein  expression  profiling  for  clinical  biomarker  assay  development  are  discussed,  with 
an  emphasis  on  utilizing  differential  lectin-based  glycoprotein  capture  and  targeted 
immuno-based  assays.  The  carbohydrate  binding  specificities  of  different  lectins  offers  a 
biological  affinity  approach  that  complements  existing  mass  spectrometer  capabilities 
and  retains  automated  throughput  options.  Specific  examples  using  serum  samples  from 
prostate  cancer  and  hepatocellular  carcinoma  subjects  are  provided,  along  with  suggested 
experimental  strategies  for  integration  of  lectin  based  methods  into  clinical  fluid 
expression  profiling  strategies.  Our  example  workflow  incorporates  the  necessity  of 
early  validation  in  biomarker  discovery  using  an  immuno-based  targeted  analytical 
approach  that  integrates  well  with  upstream  discovery  technologies. 


Serum  protein  expression  profiling  using  time-of-flight  mass  spectrometry 
approaches  to  identify  biomarkers  of  disease  has  reached  a  nexus  of  technology 
development,  clinical  study  design,  and  bioinfonnatics.  Following  a  period  of  promising 
initial  work  using  matrix-assisted  laser  desorption  ionization  (MALDI)  and  surface- 
enhanced  laser  assisted  ionization  (SELDI)  mass  spectrometry  (1-4),  new  strategies  to 
increase  the  utility  of  this  approach  for  clinical  biomarker  assay  development  are  needed 
(5,6).  Largely  employing  simple  chemical  affinity  beads  or  surfaces  to  decrease  the 
sample  complexities,  these  methods  offer  an  automated,  sensitive  technique  that 
consumes  small  amounts  of  clinical  sample  with  relatively  high  throughput  (2,  7-9). 
Concerns  with  the  approach  have  included  lack  of  analytical  reproducibility,  diminished 
robustness  of  discovered  biomarkers  during  validation,  lack  of  protein  identification,  and 
a  fear  that  the  dynamic  range  of  prevalent  proteins  in  serum  or  plasma  prohibits 
identification  of  proteins  associated  with  disease  (10-12).  Some  of  these  concerns  have 
subsequently  been  attributed  to  study  design  bias,  chance,  an  overgeneralization  of  results 
and  sample  processing  issues  (13,14).  On  the  other  hand,  when  careful  study  design  and 
sample  handling  is  combined  with  carefully  controlled  instrument  calibration,  automated 
sample  preparation,  and  supervised  bioinformatic  data  analysis,  serum  expression 
profiling  can  be  reproducible  and  portable  across  multiple  laboratories  (5,8,  15-17). 
However,  the  remaining  issues  of  protein  dynamic  range  and  complexity  continue  to 
plague  these  and  indeed  all  proteomic  approaches. 

At  this  point  in  time,  future  expression  profiling  studies  using  clinical  samples 
will  require  a  careful  balance  of  controlling  for  known  problems  while  at  the  same  time 
exploiting  the  rapid  advances  in  robotic  fractionation  and  mass  spectrometry 


technologies.  In  this  overview  of  emerging  serum  proteomic  expression  profiling 
strategies,  we  propose  that  use  of  lectin  targeting  of  serum  glycoprotein  isoforms 
provides  the  desired  experimental  balance  that  minimizes  known  study  design  concerns 
while  retaining  the  established  strengths  of  high-throughput  approaches.  The 
carbohydrate  binding  specificities  of  different  lectins  offers  a  biological  affinity  approach 
that  complements  existing  chemical  affinity  methods  and  retains  automated  throughput 
options  with  current  mass  spectrometer  capabilities.  Specific  examples  using  serum 
samples  from  prostate  cancer  and  hepatocellular  carcinoma  subjects  are  provided,  along 
with  suggested  experimental  strategies  for  integration  of  lectin  based  methods  into 
clinical  fluid  expression  profiling  strategies. 

Altered  Glycoproteins  and  Cancer 

It  has  long  been  known  that  cellular  glycosylation  profiles  change  significantly 
during  oncogenesis  (18-20),  and  hence  the  continued  search  for  tumor-secreted 
glycoproteins  that  can  serve  as  biomarkers  for  diagnostics  and/or  tumor  markers  of  the 
biological  changes  associated  with  the  altered  glycosylation  patterns  associated  with 
development  of  cancer  (21-24).  One  well  characterized  example  is  that  of  increased 
activity  of  N-Acetylglucosaminyltransferase  V  (GnT-V),  a  key  enzyme  in  the  formation 
of  branching  asparagine-linked  oligosaccharides  that  has  been  linked  to  tumor  invasion 
and  metastasis  in  multiple  cancers  (25-28).  An  increase  of  fil-6  branched 
oligosaccharides  within  metastatic  lymph  nodes  of  breast  carcinomas  has  been  reported, 
and  the  presence  of  the  branched  oligosaccharides  was  associated  with  poor  prognosis 
(28).  The  role  of  sialylated  oligosaccharides  was  evaluated  within  primary  breast  tumors, 


and  it  was  found  that  an  overall  reduction  in  the  diversity  of  sialylated  and  neutral 
oligosaccharides  occurred  with  disease  progression  (29,  30).  It  is  also  clear  that 
fucosylated  glycoproteins  are  elevated  in  individuals  with  liver,  colorectal  and  prostate 
cancers  (18,  31).  As  discussed  in  a  later  section,  we  have  also  shown  that  a 
comprehensive  characterization  of  differentially  fucosylated  serum  glycoproteins  can  be 
used  to  readily  distinguish  hepatitis  B  induced  liver  cancer  subjects  from  healthy  control 
subjects  in  blinded  assays  (32,  33). 

Prostate  specific  antigen  (PSA)  is  one  of  the  best  characterized  examples  of  a 
secreted  glycoprotein  used  in  cancer  diagnostics,  and  multiple  glycoforms  of  PSA  have 
been  described  (31,  34-36).  PSA  is  a  28,400  Da  glycoprotein  with  one  defined  N-linked 
oligosaccharide  side  chain  at  Asn45,  and  is  a  serine  protease  in  the  kallikrein  family 
(kallikrein  3)  (37).  PSA  is  secreted  primarily  by  prostatic  epithelial  cells  into  the  seminal 
plasma,  where  it  can  reach  concentrations  of  0.5-3  mg/ml  (38).  The  glycoforms  of  PSA 
from  seminal  plasma  have  been  shown  to  differ  from  the  glycosylation  of  PSA  secreted 
by  the  prostate  metastatic  tumor  cell  line  LNCaP  (31,  34,  35).  A  thorough  comparison  of 
PSA  glycofonns  from  seminal  plasma  and  serum  from  healthy  control  and  prostate 
cancer  patients  has  been  described  (36),  however,  larger  scale  studies  with  more  clinical 
emphasis  on  study  design  and  sample  numbers  are  still  needed.  Additionally,  evaluation 
of  any  differences  in  the  glycoforms  of  PSA  bound  by  carrier  proteins  in  serum 
(primarily  a-l-antichymotrypsin)  versus  that  of  the  free  circulating  PSA  remains  to  be 
determined.  Alpha-fetoprotein  (AFP)  is  another  well  characterized  serum  glycoprotein 
used  as  a  surrogate  marker  of  the  presence  of  liver  cancers,  and  multiple  glycoforms  of 
AFP  have  been  identified  (33,  39-41).  Specific  targeting  of  these  PSA  and  AFP 


glycoforms  as  the  diagnosis  for  disease  state,  or  similar  characterization  of  other  known 
serum  components  like  prostate  specific  membrane  antigen  (PSMA),  represent  a  targeted 
proteomic  strategy  amenable  to  quantitative  mass  spectrometry  strategies  that  employ 
isotope-tagging  or  mass-shift  labeling  techniques  (42,  43). 

Glycoproteomics  and  Lectins 

The  term  glycoproteomics  has  been  used  to  describe  this  emerging  branch  of 
proteomics  that  focuses  on  characterizing  the  protein  and  carbohydrate  constituents  of 
glycoproteins.  Structural  elucidation  of  mammalian  glycoproteins  has  long  relied  upon 
the  use  of  lectins,  a  class  of  proteins  found  in  plants,  bacteria,  fungi  and  animals  that  are 
known  to  bind  specific  oligosaccharide  moieties  (44-47).  Unlike  antigen-antibody 
binding  affinities,  the  affinity  constants  (Ka)  for  the  binding  of  monosaccharides  and 
oligosaccharides  to  most  lectins  are  in  the  low  micromolar  range,  but  can  be  millimolar 
(47,48).  For  affinity  capture  purposes,  it  is  the  multivalent  nature  of  both  the 
oligosaccharides  and  the  lectins  themselves  that  make  these  interactions  useful  for 
chromatography  separations  (46,47).  The  most  common  approaches  for  use  of  lectins  to 
capture  serum  glycoproteins  has  been  to  digest  the  serum  with  trypsin,  isolate  the 
glycopeptides  with  one  or  more  lectins  linked  to  a  support  resin,  elute  and  deglycosylate 
the  bound  peptides  with  Protein  N-Glycanase  F  (PNGaseF).  The  sequence  and  protein 
identities  of  these  peptides  are  detennined  by  tandem  mass  spectrometry  (42,  49-53)  or 
Fourier  transform  ion-cyclotron  resonance  (FT-ICR)  mass  spectrometry  (54-56).  These 
approaches  have  been  proven  to  be  useful  for  identifying  low  concentration  serum 
glycoproteins,  but  there  is  very  low  sample  throughput.  Strategies  that  probe  different 


lectins  bound  on  multiple  array  platforms  (57-59)  are  emerging  as  one  approach  to 
overcome  the  throughput  issue.  These  assays,  along  with  different  nanotechnology 
improvements  (60,61),  will  continue  to  evolve  toward  potential  clinical  assays. 

In  this  overview,  we  will  describe  our  approach  to  the  application  of  different 
lectins  to  enrich  for  serum  glycoforms  found  in  sera  from  prostate  cancer  and 
hepatocellular  carcinoma  subjects.  For  the  lectin  affinity  approach,  we  believe  this  offers 
several  distinct  biochemical  advantages  toward  protein  fractionation.  In  addition,  there 
are  many  lectin  types  available  commercially  in  pure,  resin-bound,  biotinylated  or 
fluorescent-labeled  fonns,  and  they  are  generally  inexpensive.  These  properties  lend 
themselves  to  automation,  in  particular  bead-based  automated  fractionation  strategies  as 
front-ends  to  mass  spectrometry  (62).  Many  major  serum  proteins,  in  particular  albumin, 
are  not  glycosylated  and  are  therefore  not  bound  efficiently  by  lectins,  offering  a 
concomitant  decrease  in  dynamic  range  of  serum  protein  targets.  Therefore,  the 
biological  affinity  of  lectins  offers  multiple  uses  and  strategies  to  maximize  the 
information  gained  from  precious  clinical  samples.  Issues  related  to  unifonnity  in 
preparation  of  different  lectins,  their  known  weak  binding  constants  and  reproducibility 
of  binding  across  large  sample  numbers  are  clear  hurdles  that  will  need  to  be  addressed  as 
higher  throughput  automation  strategies  are  pursued.  In  the  following  paragraphs,  some 
of  these  properties  are  demonstrated  for  isolation  of  serum  glycoproteins,  and  strategies 
to  automate  these  processes  while  maximizing  the  amount  of  information  gathered  from 
the  analysis  will  be  discussed. 


Fractionation  and  Identification  of  Lectin  Captured  Serum  Glycoproteins 


In  order  to  capitalize  on  the  many  glycoproteins  found  in  serum  and  the  known 
changes  in  glycosylation  associated  with  cancers  we  have  devised  a  modular  strategy  for 
specific  targeting  of  glycoproteins  in  sera  for  characterization  as  potential  biomarkers.  As 
shown  in  the  schematic  in  figure  1 ,  the  up-front  differential  lectin  affinity  capture  module 
integrates  well  with  a  variety  of  proteomic  tools  and  resources  currently  available  to 
enrich,  identify  and  characterize  serum  proteins.  As  described,  our  approach  of  stratifying 
whole  glycoproteins  is  in  contrast  to  the  more  typical  paradigm  of  isolating  glycopeptides 
prior  to  tandem  mass  spectrometry  identification  (Figure  1,  left  branch),  we  are 
attempting  to  retain  the  larger  multi-protein  complexes  in  serum  via  lectin  capture  prior 
to  trypsin  digestion  and  tandem  mass  spectrometry  (Figure  1,  right  branch).  Thus,  the 
information  gained  will  be  complimentary  to  a  glycopeptide-based  analysis.  The 
modularity  of  the  lectin  affinity  step  is  that  it  is  not  dependent  on  any  one  method  or  mass 
spectrometry  platform,  and  is  inherently  extensible  and  adaptable  to  new  technology 
improvements  at  any  point  in  the  process.  It  is  also  amenable  to  automation  with  bead- 
based  or  chip-based  robotics,  which  is  important  in  large  scale  assessment  of  sera  from 
well  defined  clinical  sets. 

Initially  we  assessed  two  parameters,  which  lectins  provided  the  most  differential 
capture,  and  whether  serum  depletion  prior  to  lectin  capture  improved  detection  of 
glycoproteins.  A  panel  of  six  lectins  was  employed  in  the  capture  step.  These  included 
two  lectins  known  to  bind  fucose  residues  linked  to  N-acetylglucosamine,  AAL  (Aleuria 
aurantia )  and  AAA,  (Anguilla  anguilla ),  a  lectin  targeting  a2,6  linked  sialic  acid 
residues,  SNA1  (Sambucus  nigra),  and  HPA  (Helix  pomatia  agglutinin)  which  binds  N- 
acetylgalactosamine  residues.  Two  broad  coverage  lectins,  wheat  germ  agglutinin  (WGA, 


binds  tenninal  N-acetylglucosamines)  and  Concanavalin  A  (ConA,  binds  terminal 
mannoses  and  glucoses)  were  also  included.  The  lectin  incubations  were  initially  done 
with  pooled  sera  from  matched  subjects  with  benign  hyperplasia  (BPH)  and  prostate 
cancer  (PCa)  (n  =  5).  Sera  were  incubated  with  each  lectin  bound  to  agarose  (E-Y 
Laboratories  or  Vector  Laboratories)  for  16  hrs,  eluted  with  the  appropriate  target 
monsaccharide  and  separated  by  SDS-PAGE  as  we  previously  described  (32,63). 
Representative  separations  of  the  eluted  proteins  from  each  lectin  are  shown  in  figure  2. 
It  is  clear  from  this  simple  gel-based  analysis  that  use  of  different  lectins  confers  selective 
enrichment  of  serum  glycoproteins,  and  some  overlap  and  redundancy  in  the  captured 
proteins.  Each  of  these  findings  is  consistent  with  known  biology  of  glycoproteins. 

Gel  slices  of  multiple  protein  bands  from  SDS  polyacrylamide  gels,  like  that 
shown  in  figure  2,  were  prepared  for  analysis  by  LC-MS/MS  or  MALDI-TOF/TOF.  A 
partial  list  of  the  proteins  identified  by  this  approach  that  are  differentially  expressed 
between  BPH  and  PCa  sera  are  presented  in  table  1,  listed  per  lectin.  Not  surprisingly, 
these  are  primarily  proteins  found  in  high  concentrations  in  serum,  so  that  it  is 
specifically  the  variant  in  glycosylation  that  is  associated  with  disease.  Such  results  of 
course  require  orthogonal  confirmation  and  further  validation  on  non-pooled  sample  sets. 
A  fucosylated  version  of  AFP  is  well  described  in  sera  associated  with  liver  cancers 
(39,40).  Identification  of  an  AFP  variant  in  prostate  cancer  sera  is  a  novel  finding,  while 
identification  of  the  haptoglobin  and  ApoA-I  variants  are  also  consistent  with  previous 
expression  profiling  studies  of  cancer  sera  (32,  64-67).  These  glycoprotein  isoforms  may 
represent  new  potential  biomarkers  for  detection  of  disease,  monitoring  cancer  treatment, 
or  surveillance  for  recurrence  post-therapy.  Regardless  of  the  eventual  disposition  of 


these  examples  as  true  biomarkers,  this  exercise  demonstrates  the  ability  to  achieve 
disease-group  specific  differential  lectin  capture. 

We  next  examined  the  effects  of  two  serum/plasma  protein  depletion  strategies  on 
the  types  of  serum  glycoproteins  captured  by  lectins.  As  a  comparison  we  show  a 
representative  display  of  the  glycoproteins  that  bind  SNA1  lectin  versus  serum  proteins 
that  do  not  bind  SNA1  lectin  (Figure  3  lanes  1  and  2).  For  the  six  lectins  tested  under  the 
conditions  utilized,  the  lack  of  albumin  binding  to  the  lectins  has  been  consistent.  This 
lack  of  albumin  binding  to  lectins  is  a  clear  advantage  to  their  use  as  biological  affinity 
reagents  and  for  up-front  fractionation  strategies.  Indeed,  the  lectin-capture  step  may 
simultaneously  concentrate  classes  of  glycoproteins  and  eliminate  the  major  blood 
proteins.  To  evaluate  depletion  of  major  blood  proteins  prior  to  lectin  incubation,  two  kits 
were  used  with  a  PCa  cancer  sera  pool:  a  ProteoPrep  20  Plasma  Immunodepletion  Kit 
(Sigma-Aldrich),  which  uses  antibody  capture  to  remove  20  of  the  most  prevalent  blood 
proteins,  and  a  Montage  Albumin  Deplete  Kit  (Millipore),  which  uses  a  blue-dye  affinity 
resin.  Under  the  conditions  used,  both  kits  were  very  effective  at  removing  major  serum 
proteins  (Figure  3,  lanes  8  and  9).  For  comparison,  an  aliquot  of  unfractionated  serum  is 
shown  (Figure  3,  lane  10).  The  depleted  protein  fraction  was  then  incubated  with  SNA1 
lectin,  and  the  bound  (Figure  3,  lanes  4  and  6)  and  unbound  proteins  (Figure  3,  lanes  5 
and  7)  separated  by  SDS-PAGE.  Comparison  of  the  bound  protein  profiles  from  the 
depeleted  fractions  with  that  of  the  lectin  alone  indicate  a  highly  similar  pattern  of  bound 
proteins,  independent  of  depletion.  Similar  results  were  obtained  with  the  AAL,  ConA 
and  WGA  lectins  (not  shown).  Much  additional  quantitative  evaluation  of  these  fractions 
remains  to  be  done,  but  these  results  demonstrate  that  lectins  can  serve  as  excellent  initial 


fractionation  resins  for  enrichment  of  serum  glycoproteins,  both  with  and  without  pre¬ 
depletion. 

Fucosylation  changes  in  serum  proteins  associated  with  HCC  carcinogenesis 

The  best  documented  change  that  occurs  in  glycosylation  during  the  development 
of  hepatocellular  carcinoma  (HCC)  is  an  increase  in  the  level  of  core  a- 1,6  linked 
fucosylation  of  AFP  (39,40).  In  HCC  and  in  testicular  cancer,  the  glycosylation  of  AFP 
shifts  from  a  simple  biantennary  glycan  to  an  a- 1,6  linked  core  fucosylated  biantennary 
glycan.  These  changes  have  been  observed  by  both  direct  glycan  sequencing  of  AFP  and 
by  increased  reactivity  of  AFP  with  a  variety  of  lectins  that  preferentially  bind  to  fucose 
containing  glycan  (41).  The  glycoform  of  AFP  that  reacts  preferentially  with  the  lectin 
lens  culinaris  (LCH)  is  referred  to  as  AFP-L3,  and  it  has  been  characterized  as  being  a 
more  specific  marker  of  HCC  than  total  AFP  protein  levels  (68-70).  Because  of  this, 
AFP-L3  was  approved  by  the  U.S  Food  and  Drug  Administration  in  2005  to  be  the  only 
diagnostic  assay  available  for  HCC. 

Recent  glycan  analysis  of  whole  serum  (71),  or  serum  that  has  been  depleted  of 
immunoglobulin  (32,33),  have  reported  that  increases  in  the  levels  of  core  fucosylation  of 
many  serum  glycoproteins  was  observed  with  the  development  of  HCC.  To  identity  those 
proteins  that  had  increased  fucosylation,  the  fucosylated  glycoproteins  associated  with 
sera  from  either  pooled  normal  or  pooled  HCC  positive  individuals  were  extracted  using 
fucose  specific  lectins  (LCH,  AAA  and  AAL)  and  the  proteome  analyzed  by  either  two 
dimensional  gel  electrophoresis  (2DE)  or  by  a  simple  LC  MS/MS  based  methodology 
designed  to  identify  fucosylated  peptides  (described  in  33).  A  representative  list  of  the 


fucosylated  proteins  identified  are  provided  in  table  2.  The  levels  of  fucosylated 
glycoforms  (Fc)  of  serum  glycoproteins  like  Fc-a-l-acid  glycoprotein,  Fc-ceruloplasmin, 
Fc-alpha-2-macroglobulin,  Fc-hemopexin,  Fc-Apo-D,  Fc-HBsAg,  and  Fc-Kininogen 
were  increased  in  patients  with  HCC,  while  the  levels  of  Fc-haptoglobin  was  decreased  in 
the  those  patients  (33).  A  similar  methodology  was  utilized  by  us  in  an  animal  model  of 
HBV  induced  HCC  and  identified  a  potential  biomarker  termed  GP73  that  has  been 
shown  to  be  2-3  time  more  sensitive  than  AFP  (32,72). 

We  have  examined  the  level  of  Fc-GP73  and  Fc-hemopexin  in  a  small  patient 
cohort  containing  a  total  of  80  patients  with  varying  degrees  of  liver  disease  (n=20  each 
healthy  subjects,  HBV  carriers,  HBV  cirrhosis,  HCC).  Analysis  of  these  samples  was 
performed  for  total  GP73  level,  for  the  level  of  fucosylated  GP-73  (Fc-GP73),  and  for  the 
level  of  fucosylated  hemopexin  (Fc-hemopexin).  Total  GP73  was  analyzed  by 
immunoblot  using  whole  serum,  and  fucosylated  species  were  analyzed  by  LCH  lectin 
extraction  of  5  pi  of  serum  followed  by  immunoblotting  of  the  fucosylated  fraction  as 
previously  described  (32,33).  A  representative  blot  from  a  subset  of  these  samples,  and 
the  effects  on  the  sensitivity,  specificity,  and  positive  predicative  values  for  these  three 
markers  are  presented  in  figure  X.  Use  of  total  GP73  levels  alone  had  a  sensitivity  of 
65%  and  a  specificity  of  90%,  which  was  very  similar  to  our  larger  blinded  study  using 
this  marker  (72).  If  the  fucosylated  glycoforms,  Fc-GP73  and  Fc-hemopexin,  are 
evaluated  alone,  this  improves  the  sensitivities  to  >  90%  and  specificities  of  100%.  These 
results  are  only  from  a  small  sample  set,  and  larger  studies  are  ongoing,  but  it 
demonstrates  that  specific  fucosylated  isofonns  of  serum  glycoproteins  represent  a  rich 
pool  of  new  biomarker  targets  for  HCC,  and  likely  many  other  cancers. 


Automatation  of  lectin-capture  for  mass  spectrometry-based  protein  expression  profiling. 

One  of  our  goals  in  developing  the  lectin  capture  strategies  was  to  incorporate  this 
into  a  relatively  high-throughput  serum  expression  profiling  platform.  This  would 
facilitate  analysis  of  the  large  number  of  clinical  samples  necessary  to  accommodate  the 
many  disease  variables  associated  with  cancers,  as  well  as  equally  important 
epidemiological,  study  design  and  biostatistical  issues.  One  approach  is  summarized  in 
figure  1 .  This  involves  capture  of  the  intact  glycoproteins  by  lectins,  followed  by  trypsin 
digestion  to  generate  peptides  and  final  affinity  separation  of  peptides  prior  to  MALDI- 
TOF  and/or  tandem  mass  spectrometry  analyses.  All  of  these  steps  can  be  fully 
automated  using  bead-based  supports  coupled  to  liquid  sample  handling  robotics. 
Commercially  available  configurations  to  accomplish  this  are  already  available,  typified 
by  the  Bruker  Daltonics  ClinProt®  system  (9,62),  which  we  have  used  for  our  analyses. 
Shown  in  figure  5  is  an  example  of  this  approach  as  applied  to  pooled  sera  from  biopsy 
proven  early  prostate  cancer  and  benign  disease,  with  PSA  values  for  both  sets  ranging 
from  2-4  ng/ml.  In  the  left  panel,  a  gel  image  of  ConA  bound  serum  glycoproteins  before 
and  after  trypsin  digestion  is  shown.  An  aliquot  of  the  undigested  and  digested  proteins 
were  incubated  with  IMAC-Cu  magnetic  beads,  processed  robotically,  and  eluted 
peptides/proteins  spotted  on  a  steel  plate  for  MALDI-TOF  analysis.  In  the  right  panel  we 
show  a  comparison  of  the  MALDI-TOF  spectra  of  the  two  fractions,  with  or  without 
trypsin  digestion.  The  increased  detection  of  m/z  peaks  in  the  trypsin  digested  samples  is 
consistent  with  assessment  of  high  molecular  weight  proteins  that  are  not  well  resolved  as 
whole  proteins.  In  addition,  we  can  observe  specific  differences  in  the  spectra  between 


PCa  and  BPH.  On  a  larger  scale,  we  propose  that  this  approach  can  be  used  to  generate 
spectra  for  expression  profiling  purposes  and  bioinformatics  analyses,  and  also  serve  as 
targets  for  further  tandem  MS  analysis.  In  our  specific  example,  this  would  be  done  using 
automated  MS/MS  using  the  MALDI-TOF/TOF.  However,  we  have  also  used  this 
approach  successfully  when  coupled  with  differential  labeling  (iTRAQ)  and  analysis 
using  ESI-MS/MS.  In  point  of  fact,  the  lectin  affinity-capture  of  whole  proteins  should 
be  amenable  to  most  mass  spectrometry  based  analysis. 

A  Pre- Validation  Strategy  for  Quantitative  Assessment  of  Protein  Biomarker  Isoforms. 

We  have  demonstrated  that  lectin-based  fractionation  strategies  can  be  an 
excellent  initial  front-end  step  for  serum  glycoprotein  isolation  and  a  powerful  approach 
toward  biomarker  discovery.  However,  establishment  of  intial  discovery  approaches 
should  be  complimented  with  compatible  analytical  confirmation  and  pre-validation 
strategies.  For  this  purpose  we  have  recommended  targeted  affinity  methods  sometimes 
referred  to  as  immuno-MS  (73,  74)  The  targeted  candidate  biomarker  may  be  either 
specific  isofonns  of  whole  proteins  or  subunits  and  fragments  of  larger  proteins.  We 
recently  demonstrated  the  power  of  immuno-MS  in  the  characterization  of  an  isoform  of 
ApoA-II,  the  over-expression  of  which  was  confirmed  to  be  specific  to  benign  prostate 
hyperplasia  (BPH)  and/or  PCa  as  compared  to  healthy  controls  (73).  Interestingly, 
although  ApoA-II  is  known  to  exist  as  an  8.7  kD  protein  in  serum,  we  found  that  a  unique 
isoform  with  a  mass  of  8.9  kD  was  associated  with  disease  state.  The  individual  serum 
isofonns  of  ApoA-II  can  be  revealed  by  immuno-capture  and  mass  spectrometry  analysis 
whereas  standard  immuno-assay  techniques  cannot  generally  discriminate  isoforms.  In 


figure  6  we  show  the  specific  capture  of  this  unique  isoform  of  serum  ApoA-II  from 
disease  specimens  only.  However,  even  though  we  could  easily  capture  purified  ApoA- 
II,  we  were  unable  to  capture  significant  amounts  of  wild-type  ApoA-II  from  serum. 
Thus,  combining  immuno-capture  with  mass  spectrometry  allows  for  analysis  of  specific 
and  altered  forms  of  the  target  protein  and  is  a  useful  intermediate  step  toward  the 
maturation  of  a  biomarker  from  discovery  to  utility. 

Clearly,  such  targeted  proteomic  strategies  that  utilize  high-affinity  or  baiting 
strategies  can  be  used  to  selectively  enrich  lower  abundance  proteins.  Baiting  strategies 
can  also  be  used  to  identify  biomarker  function  and  to  associate  the  biomarker  with  a 
disease  pathway  through  identification  of  other  proteins  with  which  it  interacts.  This 
technology  may  also  bridge  the  gap  between  hypothesis-  and  data-driven  biomarker 
discovery  by  allowing  functional  baiting  of  entire  classes  of  biomarkers  that  are 
implicated  in  disease  and  disease  progression.  An  affinity  pre-enrichment  of  target 
proteins  has  two  advantages:  low  abundance  proteins  can  be  seen  and  quantified,  and  the 
identity  of  the  proteins  (their  primary  sequence)  is  established,  allowing  subsequent 
isofonn  characterization.  We  have  begun  using  a  recently  developed  technology,  isotope- 
differentiated  binding  energy  shift  tags  (IDBEST™,  Target  Discovery,  Inc.)(43,75),  to 
quantitate  specific  serum  isoforms,  including  glycoforms.  IDBEST™  uses  specific  tags 
which  exploit  the  natural  phenomenon  of  mass  defect.  The  mass  defect  is  related  to  the 
nuclear  binding  energy  released  upon  formation  and  stabilization  of  the  nucleus  of  a 
given  element.  Bio-molecules  have  a  very  negligible  mass  shift.  A  maximum  mass  defect 
value  of  ~0.1  amu  is  obtained  for  elements  with  atomic  numbers  between  35(Br)  and 
63(Eu).  These  isotope-differentiated  binding-energy-shift  tags  shift  the  peaks  of  all  the 


tagged  species  by  about  0.1  Da,  allowing  software  to  discriminate  tagged  from  untagged 
species  directly  in  the  mass  spectrum  and  thus  eliminating  the  need  for  affinity  cleanup  of 
the  tagged  samples. 

As  an  example  of  the  application  of  this  technique  to  biomarker  assessment  we 
labeled  a  target  and  reference  samples  and  then  followed  with  immuno-capture  of  PSA. 
The  specifically  isolated  protein  complexes  are  trypsin  digested  and  subjected  to 
MALDI-TOF  analysis.  In  figure  7  we  show  an  example  of  the  specific  quantitation  of  a 
PSA  peptide  using  this  approach.  We  are  currently  evaluating  the  quantization  accuracy 
of  this  approach  by  parallel  sampling  against  clinical  PSA  information.  We  anticipate 
using  the  IDBEST™  technology  for  the  quantitation  of  specific  isoforms  of  known 
biomarkers  in  prostate  cancer,  such  as  PSA,  as  well  isoforms  of  apolipo-proteins. 
Specific  peptides  that  encompass  structural  changes  (isoforms)  of  whole  proteins  can  be 
targeted  by  modification  of  the  isolation  process.  Thus,  for  glycoforms  we  would  pre¬ 
treat  with  glycosidase  to  remove  the  glycan  and  identify  the  peptide  via  resulting  mass 
shift.  The  ability  to  carry  early  discovery  through  confirmation  and  rapid  pre-validation, 
prior  to  the  expensive  and  time-consuming  process  of  developing  isofonn  or 
glycospecific  antibodies,  should  prove  advantageous  to  accelerating  biomarker  discovery. 

Summary 

The  continued  improvements  in  proteomic  mass  spectrometry  technologies, 
coupled  with  the  human  and  other  genome  databases,  has  allowed  unprecedented 
opportunities  for  biomarker  protein  discovery  and  analysis  of  complex  proteomes  like 
serum.  An  underlying  issue  will  always  be  the  quality  of  the  starting  material,  as  this  will 


ultimately  dictate  the  quality  of  the  proteomic  data  and  utility  of  this  for  clinical 
purposes.  In  this  regard,  sample  collection,  storage  and  quality  issues,  epidemiological 
input  and  study  design  biases  will  always  influence  clinical  proteomic  studies.  Building 
from  what  was  learned  in  the  first  wave  of  serum  protein  expression  profiling  studies, 
strategies  striking  a  balance  of  sample  throughput  with  improved  depth  of  serum  protein 
capture  and  protein  identification  needs  to  continue  to  evolve.  Accommodating  these 
concerns  into  proteomic  analysis  design  is  facilitated  by  use  of  the  lectins  as  front-end 
fractionation  and  enrichment  tools.  We  have  only  described  the  use  of  six  lectins,  and 
there  are  dozens  of  other  individual  lectin  types  available  commercially  that  remain  to  be 
empirically  assessed,  and  hundreds  more  described  in  the  literature  (21,45).  Additionally, 
serial  affinity  capture  strategies  in  which  different  lectins  are  used  in  tandem  will  increase 
the  fractionation  capabilities  beyond  the  discussed  examples.  The  use  of  a  biological 
affinity  approach  targeting  known  glycosylation  changes  associated  with  cancer  is  an 
additional  benefit.  Coupling  the  glycoprotein  characterizations  to  some  type  of 
simultaneous  or  complementary  glycan  analysis  of  the  same  samples  will  further  extend 
the  utility  of  this  approach.  Identification  of  altered  carbohydrate  content,  whether  it  is 
sialic  acid  or  fucose  residue  differences,  automatically  implicates  the  corresponding 
glycosyltransferases  or  (glycosidases)  as  potential  participants  in  the  oncogenesis  of  a 
particular  cancer  subtype.  These  enzyme  classes  can  readily  be  assessed  by  other 
methods  than  proteomics,  including  gene  or  tissue  microarrays,  and  basic  biochemical 
enzymatic  assays. 
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Figure  Legends 

Figure  1.  Experimental  flow  chart  combining  lectin  capture  strategies  with  mass 
spectrometry  approaches. 

Figure  2.  Differential  Lectin  Capture  of  Serum  Glycoproteins.  Pooled  serum 
samples  (30  ul)  from  benign  prostatic  hyperplasia  and  prostate  cancer  patients  were 
incubated  with  the  indicated  lectin-agarose  beads  (50  ul  bead  volume)  for  16  hrs  (32,  63), 
and  eluted  with  the  respective  monosaccharide.  Bound  proteins  were  separated  on  8-16% 
Criterion  SDS-polyacrylamide  gels  (Bio-Rad  Laboratories,  Inc.,  Hercules,  CA)  and  were 
visualized  by  silver  staining. 

Figure  3.  Comparison  of  Serum  Protein  Depletion  and  Lectin  Capture  Strategies. 

A  pooled  prostate  cancer  serum  pool  (50  ul)  was  incubated  with  SNA1  lectin  alone,  or 
pre-fractionated  with  a  Montage  albumin  depletion  column  (Millipore,  Billerica,  MA)  or 
a  ProteoPrep  20  column  (Sigma- Aldrich,  St.  Louis,  MO)  prior  to  incubation  with  the 
SNA1  lectin.  Eluted  proteins  from  each  column  were  separated  on  8-16%  SDS-gels  and 
stained  with  Coomassie  Blue  as  follows:  1.  protein  fraction  not  bound  to  SNA1;  2. 
protein  fraction  eluted  from  SNA  1 -agarose  lectin  beads;  3.  serum  protein  fraction  not 
bound  to  the  Sigma  ProteoPrep20  column;  4.  ProteoPrep20-depleted  proteins 
bound/eluted  to  SNA1;  5.  ProteoPrep20  depleted  serum  protein  fraction  not  bound  to 
SNA1;  6.  Montage-depleted  serum  protein  fraction  bound/eluted  to  SNA1;  7.  Montage- 
depleted  serum  protein  fraction  not  bound  to  SNA1 ;  8.  Serum  proteins  bound/eluted  to 
ProteoPrep  20  column;  9.  Serum  proteins  bound  to  Montage  column;  10.  untreated 


serum. 


Figure  4.  The  level  of  Fc-GP73  and  Fc-hemopexin  in  the  serum  of  patients  with 


varying  HCC  disease  states.  A  representative  example  of  Fc-GP73  levels  (A.)  and  Fc- 
hemopexin  (B.)  in  normal  (3  patients),  cirrhotic  (4  patients)  and  HCC+  (7  patients)  as 
determined  by  immunoblot  following  isolation  of  fucosylated  serum  proteins.  (C.)  The 
sensitivity,  specificity  and  positive  predictive  value  of  total  GP73,  Fc-GP73  and  Fc- 
hemopexin  in  the  patients  shown  in  table  2. 

Figure  5.  Concanavalin  A  bound  serum  glycoproteins  digested  with  trypsin  prior  to 
MALDI-TOF.  Pooled  serum  from  benign  prostatic  hyperplasia  and  prostate  cancer  sera 
were  bound  to  concanavalin  A  lectin,  then  eluted.  A  portion  of  each  eluate  was  separated 
on  an  8-16%  SDS-gel  (A.),  either  intact  (lanes  1,2)  or  digested  overnight  with  trypsin 
(lanes  3,4).  (B.)  The  same  eluates,  trypsin  digested  or  intact,  were  incubated  with  IMAC- 
Cu  magnetic  beads,  eluted  and  applied  1 : 1  with  CHCA  matrix  to  a  steel  plate  for  analysis 
on  a  Bruker  Daltonics  Ultraflex  MALDI-TOF  in  linear  mode. 

Figure  6.  Expression  profile  of  ApoA-II  isoforms  in  control  and  prostate  cancer 
sera.  Sera  were  processed  on  IMAC-Cu  ProteinChips  for  SELDI-TOF  MS  as  previously 
described  (73).  The  mass  region  from  8000-9500  m/z  is  shown  for  purified  ApoA-II  (top 
spectra),  sera  from  a  prostate  cancer  patient  (middle  spectra)  and  sera  from  a  healthy 
nonnal  patient  (bottom  spectra). 


Figure  7.  Quantitative  Immuno-MS  assay  for  Prostate  Specific  Antigen.  Purified 
prostate  specific  antigen  was  labeled  with  a  lysine  reactive  ID-BEST™  mass  defect 
reagent  (Targeted  Discovery,  Inc.,  Palo  Alto,  CA),  digested  with  trypsin,  and  analyzed  on 
a  Bruker  Daltonics®  UltraFlex™  MALDI-TOF/TOF.  The  raw  spectra  (top  panel)  and 
mass  defect  spectra  (bottom  panel)  around  the  region  of  one  of  the  expected  PSA 
peptides  is  shown. 


Table  1.  Serum  proteins  identified  after  differential  lectin  capture 


AAL 

Albumin 

a-2-macroglobulin 
Immunoglobulin  a  Chain  C 
Serotransferrin  precursor 
AFP  -  alpha  fetoprotein 
Haptoglobin 

AAA 

Albumin 

ApoA-1 

AFP  -  alpha  fetoprotein 
Myotubularin  protein 
Transthyretin  precursor 


ConA 

Albumin 

a-2-macroglobulin 

ApoA-1 

Serum  amyloid  P  chain 
Serotransferrin  precursor 

SNA1 

a- 1 -antitrypsin 
ApoA-1 

Complement  factor  B  precursor 
Myotubularin  related  protein 
Transferrin 


Table  2.  Proteins  identified  as  altered  in  patients  with 

Identified  Fucosylated  Protein 

Fc-GP-73 
Fc-Hemopexin 
Fc-HBsAg 
Fc-AFP 

Fc-alpha-acid-glycoprotein 
Fc-alpha-1  -antichymotrypsin 
Fc-alpha-1  -antitrypsin 
Fc-Serotransferrin 
Fc-Ceruloplasmin 
Fc-alpha-2-macroglobulin 
Fc-alpha-2-HS-glycoprotein  (Fetuin  A) 
Fc-Haptoglobin 

Fc-Fibrinogen  gamma  chain  precursor 
Fc-Complement  factor  B 
Fc-IgG 
Fc-lgA 
Fc-APO-D 
Fc-IgM 
Fc-Kininogen 

Fc-Histidine  rch  glycoprotein 
Fc-Complement  CIs  component 
Fc-alpha-1  B  glycoprotein 
Fc-B-2-glycoprotein  (apo  H) 


Figure  1,  MCP  Drake  et  al. 
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Figure  2,  MCP  Drake  et  al. 
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Figure  3,  MCP  Drake  et  al. 


Figure  4,  MCP  Drake  et  al. 
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Figure  5,  MCP  Drake  et  al. 
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